Erratum: Bayes Performance of Batch Data Mining Based on Functional Dependencies
نویسندگان
چکیده
منابع مشابه
Data Analysis Based on Functional Dependencies
We present a novel approach to the design of schemas appropriate for data analysis; we call such schemas “analysis contexts”. Roughly speaking, an analysis context is a set of paths with common origin, in which the nodes are attributes and the edges are functional dependencies. Our main contributions are (a) an algorithm for generating all analysis contexts embodied in a set of attributes and a...
متن کاملMining Constant Conditional Functional Dependencies for Improving Data Quality
This paper applies the data mining techniques in the area of data cleaning as effective in discovering Constant Conditional Functional Dependencies(CCFDs) from relational databases . These CCFDs are used as business rules for context dependent data validations. Conditional Functional Dependencies(CFDs) are an extension of Functional dependencies(FDs) which captures the consistency of data by su...
متن کاملSemandaq: a data quality system based on conditional functional dependencies
We present SEMANDAQ, a prototype system for improving the quality of relational data. Based on the recently proposed conditional functional dependencies (CFDs), it detects and repairs errors and inconsistencies that emerge as violations of these constraints. We demonstrate the following functionalities supported by SEMANDAQ: (a) an interface for specifying CFDs; (b) a visual tool for automated ...
متن کاملDiagnosis of diabetes by using a data mining method based on native data
Background & Aim: Detecting the abnormal performance of diabetes and subsequently getting proper treatment can reduce the mortality associated with the disease. Also, timely diagnosis will result in irreversible complications for the patient. The aim of this study was to determine the status of diabetes mellitus using data mining techniques. Methods: This is an analytical study and its databas...
متن کاملData sanitization in association rule mining based on impact factor
Data sanitization is a process that is used to promote the sharing of transactional databases among organizations and businesses, it alleviates concerns for individuals and organizations regarding the disclosure of sensitive patterns. It transforms the source database into a released database so that counterparts cannot discover the sensitive patterns and so data confidentiality is preserved ag...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Pattern Recognition and Artificial Intelligence
سال: 2019
ISSN: 0218-0014,1793-6381
DOI: 10.1142/s0218001419920022